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Qy 1 TCCGTTCTCCGACATCGAGCAGGCCGAGCGCGCCCTGGCCGCCTGGGAGGTGCCGCATGC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 83 TCCGTTCTCCGACATCGAGCAGGCCGAGCGCGCCCTGGCCGCCTGGGAGGTGCCGCATGC 142 

Qy 61 TTGAGCCTCTTCGCCTCAGCCAGTTGACGGTCGCGCTGGACGCCCGCCTGATCGGCGAGG 120 

I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 143 TTGAGCCTCTTCGCCTCAGCCAGTTGACGGTCGCGCTGGACGCCCGCCTGATCGGCGAGG 202 

Qy 121 ACGCCGTCTTTTCGGCGGTTTCCACCGACAGTCGCGCCATCGGGCCCGGCCAACTGTTCA 18 0 

I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 203 ACGCCGTCTTTTCGGCGGTTTCCACCGACAGTCGCGCCATCGGGCCCGGCCAACTGTTCA 2 62 

Qy 181 TTGCCCTGAGTGGGCCGCGTTTCGACGGCCACGACTATCTCGCCGAGGTTGCCGCCAAGG 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 63 TTGCCCTGAGTGGGCCGCGTTTCGACGGCCACGACTATCTCGCCGAGGTTGCCGCCAAGG 322 

Qy 241 GCGCGGTGGCTGCGCTGGTGGAGCGCGAAGTCGCCGACGCGCCCTTGCCGCAATTGCTGG 300 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 323 GCGCGGTGGCTGCGCTGGTGGAGCGCGAAGTCGCCGACGCGCCCCTGCCGCAATTGCTGG 382 

Qy 301 TGCGCGATACCCGTGCGGCCCTGGGGCGACTGGGCGCGCTGAACCGGCGCAAGTTCACCG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 38 3 TGCGCGATACCCGTGCGGCCCTGGGGCGACTGGGCGCGCTGAACCGGCGCAAGTTCACCG 4 42 

Qy 361 GCCCGCTGGCGGCCATGACGGGCTCCAGCGGCAAGACCGCGGTCAAGGAGATGCTCGCCA 420 

I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 43 GCCCGCTGGCGGCCATGACGGGCTCCAGCGGCAAGACCACGGTCAAGGAGATGCTCGCCA 502 



Qy 

Db 



421 
503 



GCATCCTGCGTACCCAGGCCGGCGATGCCGAGTCGGTGCTGGCTACCCGTGGCAATCTGA 4 8 0 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GCATCCTGTGTACCCAGGCCGGCGATGCCGAGTCGGTGCTGGCTACCCGTGGCAATCTGA 5 62 
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781 
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863 


Qy 


841 
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923 


Qy 


901 


Db 


983 


Qy 


961 


Db 


1043 


Qy 


1021 


Db 


1103 


Qy 


1081 


Db 


1163 


Qy 


1141 


Db 


1223 


Qy 


1201 


Db 


1283 


Qy 


1261 


Db 


1343 



ACAACGACCTCGGCGTACCGCTGACCCTGCTGCAACTGGCGCCGCAGCACCGTAGCGCAG 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I M 1 I 1 I I I I I I I 

ACAACGACCTCGGCGTACCGCTGACCCTGCTGCAACTGGCGCCGCAGCACCGTAGCGCAG 622 

TGATCGAACTGGGCGCCTCGCGCATCGGCGAGATCGCCTACACGGTCGAGCTGACCCGCC 600 

| | | I I I I I I I I I I I I I I I I I 1 I I 1 I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I 

TGATCGAACTGGGCGCCTCGCGCATCGGCGAGATCGCCTACACGGTTGAGCTGACCCGCC 682 

CGCACGTGGCGATCATCACCAATGCCGGAACCGCCCATGTCGGCGAGTTCGGCGGACCGG 660 
| | I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CGCACGTGGCGATCATCACCAATGCCGGAACCGCCCATGTCGGCGAGTTCGGCGGACCGG 7 42 

AGAAGATCGTCGAGGCGAAGGGCGAGAT ACTCGAAGGGCTGGCCGCCGACGGCACCGCCG 720 

I | | | | | M | I I I I I I I I I I I I I I I II I II I I I I I I I I I II I I II I I I I I I I M I I I M I I 

AGAAGATCGTCGAGGCGAAGGGCGAGAT ACTCGAAGGGCTGGCCGCCGACGGCACCGCCG 802 

TACTGAACCTGGACGACAAGGCCTTCGACACCTGGAAGGCCCGTGCCAGCGGCCGTCCGT 7 80 

j | I I I I I I I I II I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

TGCTGAACCTGGACGACAAGGCCTTCGACACCTGGAAGGCCCGTGCCAGCGGCCGTCCGT 8 62 

TGCTGACTTTCTCCCTCGACCGGCCCCAGGCCGATTTCCGCGCCGCCGATCTGCAGCGCG 84 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TGCTGACTTTCTCCCTCGACCGGCCCCAGGCCGATTTCCGCGCCGCCGATCTGCAGCGCG 922 

ATGCGCGCGGCTGCATGGGCTTCAGGCTGCAGGGCGTAGCGGGTGAAGCGCAGGTCCAGC 900 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ATGCGCGTGGCTGCATGGGCTTCAGGCTGCAGGGCGTAGCGGGTGAAGCGCAGGTCCAGC 982 

TCAACCTGCTGGGGCGGCACAATGTCGCCAATGCCCTGGCTGCGGCCGCTGCCGCCCATG 960 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TCAACCTGCTGGGGCGGCACAATGTCGCCAATGCCCTGGCTGCCGCCGCTGCCGCCCATG 1042 

CACTGGGCGTGCCGCTGGATGGGATCGTCGCCGGGCTGCAGGCGCTGCAGCCGGTCAAGG 1020 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

CACTGGGCGTGCCGCTGGATGGGATCGTCGCCGGGCTGCAGGCGCTGCAGCCGGTCAAGG 1102 

GCCGCGCGGT AGCGCAACTGACCGCCAGCGGGCTGCGTGTGATAGACGACAGCTACAACG 108 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GCCGCGCGGTAGCGCAACTGACCGCCAGCGGGCTGCGTGTGATAGACGACAGCTACAACG 1162 

CCAACCCCGCGTCAATGCTGGCGGCGATTGATATACTGAGCGGCTTTTCCGGGCGCACCG 114 0 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I 

CCAACCCCGCGTCAATGCTGGCGGCGATTGATATACTGAGCGGCTTTTCCGGGCGCACCG 1222 
TCCTGGTCCTCGGAGACATGGGCGAACTCGGTTCCTGGGCCGAGCAGGCCCACCGCGAGG 1200 

I I II I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I M I I II I I I I I I I I I I I I II II 

TCCTGGTCCTCGGAGACATGGGCGAACTCGGTTCCTGGGCCGAGCAGGCCCACCGCGAGG 128 2 
TGGGCGCCTACGCCGCTGGCAAGGTGTCCGCGCTCTATGCGGTCGGACCGCTGATGGCCC 12 60 

I I I I I II I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

TGGGCGCCT ACGCCGCTGGCAAGGTGTCCGCGCTCTATGCGGTCGGACCGCTGATGGCCC 134 2 
ACGCCGTACAGGCGTTCGGCGCCACGGGCCGGCACTTCGCCGACCAGGCCAGCCTGATCG 1320 

I II I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

ACGCCGTACAGGCGTTCGGCGCCACGGGCCGGCACTTCGCCGACCAGGCCAGCCTGATCG 14 02 



1321 GGGCGCTGGCCACCGAACAACCGACAACCACCATTTTGATCAAGGGTTCCCGCAGTGCGG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

14 03 GGGCGCTGGCCACCGAACAACCGACAACCACCATTTTGATCAAGGGTTCCCGCAGTGCGG 

1381 CGATGGACAAAGTCGTCGCGGCGCTGTGCGGTTCCTCCGAGGAGAGTCACTAA 14 33 

I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I 

14 63 CGATGGACAAAGTCGTCGCGGCGCTGTGCGGTTCCTCCGAGGAGAGTCACTAA 1515 
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070-778-2 (1-458) x US-09-252-991A-7701 (1-1515) 

1 MetLeuGluProLeuArgLeuSerGlnLeuThrValAlaLeuAspAlaArgLeuIleGly 20 
I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
139 ATGCTTGAGCCTCTTCGCCTCAGCCAGTTGACGGTCGCGCTGGACGCCCGCCTGATCGGC 198 

21 GluAspAlaValPheSerAlaValSerThrAspSerArgAlalleGlyProGlyGlnLeu 4 0 
I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I 
199 GAGGACGCCGTCTTTTCGGCGGTTTCCACCGACAGTCGCGCCATCGGGCCCGGCCAACTG 258 

41 1 PhelleAlaLeuSerGlyProArgPheAspGlyHisAspTyrLeuAlaGluValAlaAla 60 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I 

259 TTCATTGCCCTGAGTGGGCCGCGTTTCGACGGCCACGACTATCTCGCCGAGGTTGCCGCC 318 

61 LysGlyAlaValAlaAlaLeuValGluArgGluValAlaAspAlaProLeuProGlnLeu 80 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
319 AAGGGCGCGGTGGCTGCGCTGGTGGAGCGCGAAGTCGCCGACGCGCCCCTGCCGCAATTG 37 8 

81 LeuValArgAspThrArgAlaAlaLeuGlyArgLeuGlyAlaLeuAsnArgArgLysPhe 100 
I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I ! I 1 I I I I I I I I I I I I I I I I I I I I I I 

37 9 CTGGTGCGCGATACCCGTGCGGCCCTGGGGCGACTGGGCGCGCTGAACCGGCGCAAGTTC 4 38 

101 ThrGlyProLeuAlaAlaMetThrGlySerSerGlyLysThrAlaValLysGluMetLeu 120 

I I I I I I II I I I I I I I I I I I I I I I I I I I ! I I E I I I I ! I I I I I I I I I I I I I I I II I I I I 
4 39 ACCGGCCCGCTGGCGGCCATGACGGGCTCCAGCGGCAAGACCACGGTCAAGGAGATGCTC 4 98 

121 AlaSerlleLeuArgThrGlnAlaGlyAspAlaGluSerValLeuAlaThrArgGlyAsn 14 0 

I I I I I I I I I I I I I I I I I I II I I I II I II I I I I I I I I I I I I I II I I I I I I II I I I II I 
4 99 GCCAGCATCCTGTGTACCCAGGCCGGCGATGCCGAGTCGGTGCTGGCTACCCGTGGCAAT 558 

141 LeuAsnAsnAspLeuGlyValProLeuThrLeuLeuGlnLeuAlaProGlnHisArgSer 160 
I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

55 9 CTGAACAACGACCTCGGCGTACCGCTGACCCTGCTGCAACTGGCGCCGCAGCACCGTAGC 618 

161 AlaVallleGluLeuGlyAlaSerArglleGlyGluIleAlaTyrThrValGluLeuThr 180 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I 

619 GCAGTGATCGAACTGGGCGCCTCGCGCATCGGCGAGATCGCCTACACGGTTGAGCTGACC 67 8 

181 ArgProHisValAlallelleThrAsnAlaGlyThrAlaHisValGlyGluPheGlyGly 2 00 
I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

67 9 CGCCCGCACGTGGCGATCATCACCAATGCCGGAACCGCCCATGTCGGCGAGTTCGGCGGA 738 

201 ProGluLysIleValGluAlaLysGlyGluIleLeuGluGlyLeuAlaAlaAspGlyThr 22 0 
I I I I M I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I 

739 CCGGAGAAGATCGTCGAGGCGAAGGGCGAGATACTCGAAGGGCTGGCCGCCGACGGCACC 7 98 
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Db 


979 
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Db 


1039 


Qy 


321 


Db 


1099 


Qy 


341 


Db 


1159 


Qy 


361 


Db 


1219 


Qy 


381 


Db 


1279 


Qy 


401 


Db 


1339 


Qy 


421 


Db 


1399 


Qy 


441 


Db 


1459 



AlaValLeuAsnLeuAspAspLysAlaPheAspThrTrpLysAlaArgAlaSerGlyArg 24 0 

I I I I I I I I II I I I II I I I I I I I I I I I I I II I I I I I M I I I I i I 

GCCGTGCTGAACCTGGACGACAAGGCCTTCGACACCTGGAAGGCCCGTGCCAGCGGCCGT 858 

ProLeuLeuThrPheSerLeuAspArgProGlnAlaAspPheArgAlaAlaAspLeuGln 260 

Ill I I I I I I I I I I I I I I I I I M I I I II I 

CCGTTGCTGACTTTCTCCCTCGACCGGCCCCAGGCCGATTTCCGCGCCGCCGATCTGCAG 918 

ArgAspAlaArgGlyCysMetGlyPheArgLeuGlnGlyValAlaGlyGluAlaGlnVal 280 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CGCGATGCGCGTGGCTGCATGGGCTTCAGGCTGCAGGGCGTAGCGGGTGAAGCGCAGGTC 97 8 

GlnLeuAsnLeuLeuGlyArgHisAsnValAlaAsnAlaLeuAlaAlaAlaAlaAlaAla 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CAGCTCAACCTGCTGGGGCGGCACAATGTCGCCAATGCCCTGGCTGCCGCCGCTGCCGCC 1038 

HisAlaLeuGlyValProLeuAspGlylleValAlaGlyLeuGlnAlaLeuGlnProVal 320 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CATGCACTGGGCGTGCCGCTGGATGGGATCGTCGCCGGGCTGCAGGCGCTGCAGCCGGTC 1098 

LysGlyArgAlaValAlaGlnLeuThrAlaSerGlyLeuArgVallleAspAspSerTyr 34 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

AAGGGCCGCGCGGTAGCGCAACTGACCGCCAGCGGGCTGCGTGTGATAGACGACAGCTAC 1158 



I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



360 
1218 



ThrValLeuValLeuGlyAspMetGlyGluLeuGlySerTrpAlaGluGlnAlaHisArg 380 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

ACCGTCCTGGTCCTCGGAGACATGGGCGAACTCGGTTCCTGGGCCGAGCAGGCCCACCGC 127 8 



I I I I I I I I II II I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 



AlaHisAlaValGlnAlaPheGlyAlaThrGlyArgHisPheAlaAspGlnAlaSerLeu 420 

I I I I I I I I I I I I I I I I I I I I I I II I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GCCCACGCCGTACAGGCGTTCGGCGCCACGGGCCGGCACTTCGCCGACCAGGCCAGCCTG 1398 

IleGlyAlaLeuAlaThrGluGlnProThrThrThrlleLeuIleLysGlySerArgSer 4 40 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

ATCGGGGCGCTGGCCACCGAACAACCGACAACCACCATTTTGATCAAGGGTTCCCGCAGT 14 58 

AlaAlaMetAspLysValValAlaAlaLeuCysGlySerSerGluGluSerHis 4 58 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

GCGGCGATGGACAAAGTCGTCGCGGCGCTGTGCGGTTCCTCCGAGGAGAGTCAC 1 51 2 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



March 24, 2004, 01:45:30 ; Search time 594 Seconds 

(without alignments) 
10363.029 Million cell updates/sec 

US-10-070-778-1 
1449 

1 tccgttctccgacatcgagc ctaatgctcctgctgctggc 1449. 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



3373863 seqs, 2124099041 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



6747726 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



S ft 

Ik 



d a) A q/g£&6 4 fa, 



N Geneseq_2 9Jan04 : * 

1 : geneseqnl980s : * 

2 : geneseqnl990s : * 

3: geneseqn2000s : * 

4: geneseqn2001as : * 

5 : geneseqn2001bs : * 

6 : geneseqn2002s : * 

7: geneseqn2003as : * 

8: geneseqn2003bs : * 

9: geneseqn2003cs : * 
10: geneseqn2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



) . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


1449 


100 


.0 


1449 


4 


AAS01762 


Aas01762 


Pseudomon 


2 


1372.2 


94 


.7 


1377 


4 


AAS54277 


Aas54277 


Pseudomon 


3 


1372.2 


94 


.7 


1377 


7 


ACA42600 


Aca42600 


Prokaryot 


4 


695.6 


4 8 


.0 


1365 


7 


ACA4 4 2 61 


Aca44261 


Prokaryot 


5 


692.4 


47 


.8 


1356 


• 7 


ACA45660 


Aca45660 


Prokaryot 


6 


317 


21 


.9 


2835 


7 


ACA27387 


Aca27387 


Prokaryot 


7 


296 


20 


. 4 


1404 


7 


ACA2 62 93 


Aca26293 


Prokaryot 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 



March 24, 2004, 03:40:40 ; Search time 133 Seconds 

(without alignments) 
6046.044 Million cell updates/sec 

US-10-070-778-1 
1449- 

1 tccgttctccgacatcgagc ctaatgctcctgctgctggc 1449 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 682709 seqs, 277475446 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1365418 



Database : 



Issued_Patents_NA: * 

/cgn2_6/ptodata/2/ina/5A_COMB. seq: * 
/cgn2_6/ptodata/2/ina/5B_COMB. seq: * 
/cgn2__6/ptodata/2/ina/6A_COMB. seq: * 
/cgn2_6/ptodata/2/ina/6B_COMB. seq: * 
/cgn2_6/ptodata/2/ina/PCTUS_COMB. seq: * 
/cgn2_6/ptodata/2/ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


1 


1421. 8 


98.1 


1515 


4 


US-09-252-991A-7701 


c 2 


1079.8 


7 4.5- 


1215 


4 


US-09-252-991A-7862 


3 


338 


23.3 


1404 


4 


US-09-252-991A-7623 


4 


288.8 


19.9 


1437 


4 


US-09-489-039A-5205 


5 


268.6 


18.5 


1360 


2 


US-08-732-612-1 


c 6 


214 


14.8 


1611 


4 


US-09-252-991A-7930 


c 7 


139.8 


9.6 


4411529 


3 


US-09-103-840A-1 


c 8 


137.2 


9.5 


4403765 


3 


US-09-103-840A-2 


9 


134 


9.2 


1419 


4 


US-09-543-681A-2913 


10 


123.8 


8.5 


1830121 


4 


US-09-557-884-1 


11 


123.8 


8.5 


1830121 


4 


US-09-643-990A-1 



Description 



Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 



7701, Ap 
7862, Ap 
7623, Ap 
5205, Ap 
1, Appli 
7 930, Ap 

1, Appli 

2, Appli 
2913, Ap 

1, Appli 
1, Appli 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



March 24, 2004, 05:32:38 



Search time 528 Seconds 

(without alignments) 

10151.654 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-070-778-1 

1449 

1 tccgttctccgacatcgagc 



ctaatgctcctgctgctggc 1449 



Scoring table: 



IDENTITY_NUC 
Gapop 10.0 , 



Gapext 1.0 



Searched: 



2438257 seqs, 1849576744 residues 



Total number of hits satisfying chosen parameters: 



4876514 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



Published_Applications_NA: * 



1: /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW__PUB.seq: * 

3: /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5: /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 

6: /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9: /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB. seq: * 

10: /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB. seq: * 

11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 

12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 

13: /cgn2_6/ptodata/2/pubpna/US'10A_PUBCOMB. seq: * 

14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB. seq: * 

15 : /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB.seq: * 

16: /cgn2_6/ptodata/2/pubpna/US10_NEW__PUB.seq: * ' 

17 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB.seq: * 

18 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB.seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 



Slo. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


1372 


.2 


OA 


, / 


1377 


9 


US-09-815-242-7914 


Sequence 7 914, Ap 


2 


1372 


.2 


Q/1 


, / 


1377 


12 


US-10-282-122A-304 7 0 


Sequence 30470, A 


3 


695 


. 6 


A Q 


, u 


1365 


12 


US-10-282-122A-32131 


Sequence 32131, A 


4 


692 


.4 


A 1 


p 


1356 


12 


US-10-282-122A-33530 


Sequence 33530, A 


5 


317 


9 1 


Q 


2835 


12 


US- 10-282- 122A- 15257 


Sequence 15257, A 


6 


296 


9 n 
z u . 


, fi 


1404 


12 


US-10-282-122A-14163 


Sequence 14163, A 


7 


290 


. 6 


9 n 
z u , 


, ± 


1356 


12 


US-10-282-122A-23403 


Sequence 23403, A 


8 


278 


1 Q 


o 
, Z 


1359 


9 


US-09-741-669-257 


Sequence 2 57, App 


9 


278 




9 
. <£. 


1359 


9 


US-09-815-242-5930 


Sequence 5930, Ap 


10 


278 


1 Q 


, z 


1359 


12 


US-10-282-122A-20227 


Sequence 20227, A 


11 


252 


1 7 

_L / . 


. y 


1356 


12 


US- 10-28 2 -122A- 19658 


Sequence 19658, A 


12 


247 


.2 


1 7 


i 

. ± 


1359 


12 


US-10-282-122A-39833 


Sequence 39833, A 


13 


246 


.2 


1 1 
± i < 




1359 


9 


US-09-815-242-9976 


Sequence 9976, Ap 


14 


244 


.2 


1 0 . 


q 


1410 


12 


US-10-282-122A-12920 


Sequence 12920, A 


15 


244 


1 6 


p 


1359 


12 


US-10-282-122A-38999 


Sequence 38999, A 


16 


218 


1 R 


n 


1362 


12 


US-10-282-122A-41276 


Sequence 41276, A 


17 


217 


. 6 


1 R 


n 
. u 


1158 


12 


US-10-282-122A-11387 


Sequence 11387, A 


18 


204 


. 4 


1 4 




1364 


12 


US-10-282-122A-37 04 8 


Sequence 37048, A 


19 


194 


. 6 


1 ? 

-L O « 


A 


1377 


12 


US-10-282-122A-41735 


Sequence 41735, A 


20 


192 




9 
, J) 


1413 


14 


US-10-156-761-6095 


Sequence 6095, Ap 


21 


192 


1 9 


9 


9025608 


14 


US-10-156-761-1 


Sequence 1, Appli 


22 


187 


. 4 


1 9 
1 Z , 


q 


1308 


12 


US-10-282-122A-28948 


Sequence 28948, A 


23 


184 


.2 


1 9 
J.Z . 


~j 
> 1 


1359 


12 


US-10-282-122A-29881 


Sequence 29881, A 


24 


139 


.8 


Q 
z? , 


> u 


1533 


9 


US-09-712-363-81 


Sequence 81, Appl 


25 


139 


.8 


Q 
-7 . 


. 0 


1533 


12 


US-1 0-282- 122A-28471 


Sequence 28471, A 


26 


138 


.8 


q 




1647 


12 


US-10-282-122A-28781 


Sequence 28781, A 


27 


129 


.4 


Q 

O , 


q 


1380 


12 


US-10-282-122A-32631 


Sequence 32631, A 


28 


126 


D 

o . 


7 


1383 


12 


US-10-282-122A-30711 


Sequence 30711, A 


29 


123 


.8 


Q 

o « 




1374 


9 


US-09-815-242-7088 


Sequence 7088, Ap 


30 


123 


. 8 


P 

o « 


s 


1374 


12 


US- 10-282- 122A-22234 


Sequence 22234, A 


31 


123 


. 8 


D 

0 . 


, 3 


1830121 


14 


US-10-329-960-1 


Sequence 1, Appli 


32 


123 


. 8 


Q 

O . 


, 3 


1830121 


15 


US-10-329-670-1 


Sequence 1, Appli 


33 


94 


. 6 


f, 

o , 


, O 


1482 


12 


US- 10-282- 122A-2 687 3 


Sequence 26873, A 


34 


85 


. 6 


5 , 


, 9 


1488 


12 


US-10-282-122A-178 4 3 


Sequence 17843, A 


35 


82 


. 6 


5. 


,7 


1398 


12 


US-10-282-122A-8531 


Sequence 8531, Ap 


36 


81 


.8 


5. 


. 6 


981 


12 


US-10-282-122A-25880 


Sequence 25880, A 


37 


77 


.2 


5. 


,3 


1518 


14 


US-10-156-761-6094 


Sequence 6094, Ap 


38 


73 


.4 


5. 


,1 


1542 


9 


US-09-738-626-2369 


Sequence 2369, Ap 


39 


73 


. 4 


5. 


1 


3309400 


9 


US-09-738-626-1 


Sequence 1, Appli 


40 


69 


.4 


4 . 


8 


15738 


14 


US-10-329-079-12 


Sequence 12, Appl 


41 


69 


. 4 


4 . 


8 


37360 


14 


US-10-329-079-6 


Sequence 6, Appli 


42 


69 


.2 


4 . 


8 


894 


12 


US-10-282-122A-117 87 


Sequence 11787, A 


43 


68 


4 . 


7 


708 


12 


US-10-282-122A-12357 


Sequence 12357, A 


44 


67 


. 6 


4 . 


7 


822 


14 


US-10-156-761-3410 


Sequence 3410, Ap 


45 


66 


. 4 


4 . 


6 


77536 


10 


US-09-940-316B-1 


Sequence 1, Appli 



ALIGNMENTS 



RESULT 1 

US-09-815-242-7914 



GenCore version 5.1.6 
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OM nucleic - nucleic search, using sw model 



Run on: 



March 24, 2004, 03:38:31 



Search time 3537 Seconds 

(without alignments) 

12233.616 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-070-778-1 
1449 

1 tccgttctccgacatcgagc. 



. ctaatgctcctgctgctggc 1449 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



27513289 seqs, 14931090276 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



55026578 



Database 



EST: * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



em_estba : * 
em_esthum: * 
em_estin : * 
em__estmu : * 
em_estov : * 
em_estpl : * 
em^estro : * 
em_htc : * 
gb__estl : * 
gb_est2:* 
gb_htc : * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
em_est f un : * 
em_estom: * 
em_gss_hum: 
em_gss_inv: 
em_gss_pln: 
em_gss_vrt : 
em_gss_fun: 
em_gss_mam: 
em_gss_mus: 
em_gss_pro : 
em_gss__rod: 
em_gss_phg : 
em_gss_vrl : 



28: gb_gssl:* 
29: gb_gss2:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


610.2 


A O 

4 Z . 


, 1 


1108 


28 


BZ559252 


BZ559252 


pacs2-164 


c 


2 


566.8 


3 9 . 


. 1 


1214 


28 


BZ572325 


BZ572325 


msh2_2585 




3 


441.8 






1215 


28 


BZ575690 


BZ575690 


msh2 4579 


c 


4 


416 


o o 
ZD . 


. / 


666 


28 


BZ572149 


BZ572149 


msh2_2530 




5 


407.8 


Zo . 


. 1 


1026 


28 


BZ559351 


BZ559351 


pacs2-164 


c 


6 


149.8 


1 U . 


o 
. J 


838 


28 


CC115454 


CC115454 


NDL. 42F3. 


c 


7 


145.2 


1U . 


n 
, U 


685 


28 


CC140248 


CC140248 


NDL. 42F4 . 


c 


8 


127.6 


Q 

O . 


o 
. 0 


684 


28 


AQ968325 


AQ968325 


LERJB56TR 




9 


127 .2 


o 

o , 


o 

, 0 


817 


29 


CNS01IUM 


AL146223 


Anopheles 


c 


10 


125.8 


Q 

O - 


, 1 


346 


10 


BF837914 


BF837914 


RC0-HT095 




11 


88.2 


b . 


, 1 


1285 


28 


BZ575700 


BZ575700 


msh2 4583 


c 


12 


80.6 


r; 

0 . 


. b 


508 


9 


AV629852 


AV629852 AV629852 


c 


13 


74.6 


5 . 


. 1 


925 


29 


CNS0091P 


AL053013 


Drosophil 




14 


68.8 


A 

4 . 


, / 


925 


29 


CNS0091P 


AL053013 


Drosophil 




15 


68.8 


A 

4 . 


n 

, 1 


1610 


28 


BZ569386 


BZ569386 


pacs2-164 




16 


67.4 


A 

4 . 


1 


982 


13 


BX415111 


BX415111 


BX415111 




17 


67 .2 


A 

4 . 


b 


1189 


29 


AG030608 


AG030608 


Pan trogl 




18 


67 .2 


A 

4 . 


b 


1201 


13 


BX405071 


BX405071 


BX405071 




19 


64 .2 


A 


4 


1798 


29 


AG171124 


AG171124 


Pan trogl 




20 


63.6 


A 

4 . 


4 


893 


28 


AZ193531 


AZ193531 


SP_1023_B 


c 


21 


63.6 


A 

4 . 


A 

4 


1610 


28 


BZ569386 


BZ569386 


pacs2-164 


c 


22 


63.4 


A 

4 . 


4 


911 


29 


CC704337 


CC704337 


OGUGX11TV 




23 


63.4 


4 . 


4 


915 


29 


CG352123 


CG352123 


OGXBA55TV 


c 


24 


63.2 


A 

4 . 


4 


1189 


29 


AG030608 


AG030608 


Pan trogl 




25 


63 


A 

4 . 


o 


998 


29 


AG046208 


AG046208 


Pan trogl 


c 


26 


62 


A 

4 . 


3 


1421 


28 


BZ569488 


BZ569488 


pacs2-164 


c 


27 


61.8 


A 

4 . 


3 


1134 


29 


AG061098 


AG061098 


Pan trogl 




28 


60.8 


4 . 


2 


494 


12 


BG455820 


BG455820 


NF070E07P 


c 


29 


60.6 


4 . 


2 


1015 


28 


BZ569259 


BZ569259 


pacs2-164 




30 


60.4 


4 . 


2 


564 


29 


CG224720 


CG224720 


OG2AT66TV 


c 


31 


60.4 


4 . 


2 


663 


29 


CG245199 


CG245199 


OG0DZ87TH 




32 


60.4 


4. 


2 


1542 


29 


AG032943 


AG032943 


Pan trogl 


c 


33 


59.6 


4 . 


1 


911 


29 


AG043617 


AG043617 


Pan trogl 




34 


59.2 


4 . 


1 


1145 


29 


AG082667 


AG082667 


Pan trogl 




35 


59 


4. 


1 


706 


14 


CF243689 


CF243689 


3530_1_23 




36 


59 


4. 


1 


849 


29 


CG343694 


CG343694 


OGWIQ64TV 


c 


37 


58.8 


4 . 


1 


1637 


28 


BZ572329 


BZ572329 


msh2 2586 


c 


38 


58 . 6 


4 . 


0 


807 


29 


CG271592 


CG271592 


OG3CW52TH 




39 


58.6 


4. 


0 


831 


29 


CC722906 


CC722906 


OGKAU13TH 




40 


58.6 


4. 


0 


835 


14 


CB904973 


CB904973 


tric040xn 




41 


58.6 


4 . 


0 


835 


14 


CF876206 


CF876206 


tric040xn 




42 


58 . 6 


4 . 


0 


935 


29 


CG271603 


CG271603 


OG3CW52TV 


c 


43 


58.6 


4 . 


0 


1093 


13 


BQ939664 


BQ939664 


AGENCOURT 


c 


44 


58.4 


4 . 


0 


1956 


29 


CG754548 


CG754548 


P050-1-A1 




45 


58 .2 


4 . 


0 


562 


29 


CG055749 


CG055749 


PUICM11TB 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



March 24, 2004, 02:36:25 ; Search time 5617 'Seconds 

(without alignments) 
11181.068 Million cell updates/sec 

US-10-070-778-1 
1449 

1 tccgttctccgacatcgagc ctaatgctcctgctgctggc 1449 

IDENTITY_NUC 

Gapop 10. .0 , Gapext 1.0 



Searched : 



3470272 seqs, 21671516995 residues 



Total number of hits satisfying chosen parameters: 



6940544 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



GenEmbl : * 



1: 
2: 
3: 
4 ; 
5: 
6: 
7 : 
8: 
9: 
10: 
11: 
12: 
13: 
14 : 
15: 
16: 
17: 
18: 
19: 
20: 
21: 
22: 
23: 
24 : 
25: 
26: 
27: 



gb_ba : * 
gb_htg: * 
gb_in: * 
gb_om : * 
gb_ov : * 
gb_pat : * 
gb__ph : * 
gb_pl : * 
gb_pr : * 
gb_ro : * 
gb_sts : * 
gb_sy : * 
gb_un : * 
gb_vi : * 
em_ba : * 
em_f un : * 
em_hum : * 
em in : * 
em_mu : * 
em_om : * 
em_or : * 
em_ov : * 
emjpat : * 
em_ph : * 
em_pl : * 
emro : * 
em sts : * 



28 


em_ 


un: * 


29 


em 


vi : * 


30 


em 


htg hum:* 


31 


em 


htg inv:* 


32 


em 


htg other :* 


33 


em 


htg mus : * 


34 


em_ 


_htg_pln: * 


35 


em_ 


_htg_rod: * 


36 


em 


htg mam: * 


37 


em 


_htg_vrt : * 


38 


em 


sy : * 


39 


em 


_htgo hum:* 


40 


em 


htgo mus : * 


41 


em 


htgo other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


1449 


100 


. 0 


5280 


1 


AY008276 


AY008276 


Pseudomon 


c 


2 


1444.2 




. / 


24000 


1 


AE004856 


AE004856 


Pseudomon 




3 


734 


50 


m 7 


301995 


1 


AE016779 


AE016779 


Pseudomon 


c 


4 


732 


50 


.5 


311600 


1 


AE016871 


AE016871 


Pseudomon 


c 


5 


390.2 


26 


. 9 


208524 


1 


AE016925 


AE016925 


Chromobac 


c 


6 


323.4 


22 


.3 


344805 


1 


BX640434 


BX640434 


Bordetell 


c 


7 


320.2 


22 


. 1 


348074 


1 


BX640449 


BX640449 


Bordetell 


c 


8 


317 


21 


. 9 


348134 


1 


BX640420 


BX640420 


Bordetell 




9 


308.6 


21 


.3 


13045 


1 


AE012171 


AE012171 


Xanthomon 


c 


10 


296.8 


20 


. 5 


188050 


1 


AL646072 


AL646072 


Ralstonia 




11 


288.8 


19 


. 9 


1437 


6 


AR388476 


AR388476 


Sequence 




12 


281.2 


19 


. 4 


11441 


1 


AE015046 


AE015046 


Shigella 




13 


281.2 


19 


. 4 


290029 


1 


AE016978 


AE016978 


Shigella 




14 


279.6 


19 


. 3 


1800 


1 


ECU67891 


U67891 Escherichia 




15 


278 


19 


.2 


1359 


6 


AX189056 


AX189056 


Sequence 




16 


278 


19 


.2 


1800 


1 


ECU67893 


U67893 Escherichia 




17 


278 


19 


.2 


12791 


1 


AE005185 


AE005185 


Escherich 




18 


278 


19 


.2 


21757 


1 


AE000118 


AE000118 


Escherich 




19 


278 


19 


.2 


281530 


1 


AP002550 


AP002550 


Escherich 




20 


276.4 


19 


. 1 


1491 


1 


ECMURF 


X15432 E 


coli mur 




21 


276.4 


19 


. 1 


28277 


6 


AX191720 


AX191720 


Sequence 




22 


276.4 


19 


. 1 


28277 


15 


EC2MIN 


X55034 E. coli 2 m 




23 


276.4 


19 


1 


111408 


1 


ECO110K 


D10483 Escherichia 




24 


275.6 


19 


0 


13079 


1 


AE011708 


AE011708 


Xanthomon 


c 


25 


275.6 


19 


0 


212737 


2 


AC147207 


AC147207 


Silurana 




26 


266.8 


18 


4 


300409 


1 


AE016755 


AE016755 


Escherich 




27 


247.2 


17 


1 


251050 


1 


AL627265 


AL627265 


Salmonell 




28 


247 .2 


17 


1 


300169 


1 


AE016834 


AE016834 


Salmonell 




29 


244 


16 


8 


22348 


1 


AE008699 


AE008699 


Salmonell 


c 


30 


218 


15. 


0 


22201 


1 


AE004310 


AE004310 


Vibrio ch 


c 


31 


214.8 


14 . 


8 


69301 


1 


STU82965 


U82965 Streptomcye 




32 


213.4 


14 . 


7 


301727 


1 


AE016960 


AE016960 


Coxiella 




33 


213.4 


14. 


7 


316050 


1 


BX321859 


BX321859 


Nitrosomo 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: March 24, 2004, 06:34:44 ; Search time 2793 Seconds 

(without alignments) 
4896.842 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-10-070-778-2 

2262 

1 MLEPLRLSQLTVALDARLIG RSAAMDKVVAALCGSSEESH 458 



BLOSUM62 
Xgapop 10.0 
Ygapop 10.0 
Fgapop 6 . 0 
Delop 6.0 



Xgapext 0 . 5 

Ygapext 0 . 5 

Fgapext 7 . 0 

Delext 7.0 



Searched: 



27513289 seqs, 14931090276 residues 



Total number of hits satisfying chosen parameters: 



55026578 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlp 

Q=/cgn2_l/USPTO_spool_p/US10070778/runat_2 3032004_141617_6672/app_query. fasta 
64 7 

-DB=EST -QFMT=fastap -SUFFIX=p2n . rst -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN-200 -THR_SCORE=pct -THR_MAX-100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US10070778_@CGN_l_l_4237_@runat__23032004_141617_6672 -NCPU-6 -ICPU-3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 
-DEVJTIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP- 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database 



EST: 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 



em_estba : * 
em_esthum: 1 
em_estin : * 
em_estmu : * 
em_estov : * 
em_estpl : * 
em_estro : * 
em_htc: * 
gb_estl : * 
gb est2:* 



11 


: gb 


htc : * 




12 


: gb 


_est3: * 




13 


: gb 


est4 : * 




14 


• gb_ 


~est5: * 




15 


em 


_estfun: * 




16 


: em 


estom: * 




17 


em 


_gss hum: 




18 


ein_ 


gss inv: 


* 


19 


em 


_gss_pln : 


+ 


20 


em 


_gss_vrt : 


■k 


21 


em 


gss fun: 


* 


22 


em 


gss mam: 




23 


em 


gss mus : 


* 


24 


em 


gss pro: 




25 


em 


gss rod: 


★ 


26 


em 


gss phg: 




27; 


em 


_gss_vrl : 




28: 


gb" 


gssl : * 




29: 


gb_ 


"gss2: * 





Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

g, 
o 



Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


1074 


47 


5 


1108 


28 


BZ559252 


BZ559252 


pacs2-164 


c 


2 


950.5 


42 


0 


1214 


28 


BZ572325 


BZ572325 


msh2_2585 




3 


779.5 


34 


5 


1215 


28 


BZ575690 


BZ575690 


msh2 4579 


c 


4 


691.5 


30 


6 


666 


28 


BZ572149 


BZ572149 


msh2 2530 




5 


646 


28. 


6 


1026 


28 


BZ559351 


BZ559351 


pacs2-164 


c 


6 


434 


19. 


2 


838 


28 


CC115454 


CC115454 


NDL. 42F3. 


c 


7 


423.5 


18. 


7 


684 


28 


AQ968325 


AQ968325 


LERJB56TR 


c 


8 


403 


17 . 


8 


685 


28 


CC140248 


CC140248 


NDL.42F4 . 




9 


384 


17. 


0 


817 


29 


CNS01IUM 


AL146223 


Anopheles 




10 


279.5 


12. 


4 


623 


14 


CB095990 


CB095990 


ie98f06.b 


c 


11 


273.5 


12. 


1 


346 


10 


BF837914 


BF837914 


RC0-HT095 


c 


12 


210 


9. 


3 


6303 


28 


BH771012 


BH771012 


LLMGtag73 




13 


208.5 


9. 


2 


494 


12 


BG455820 


BG455820 


NF070E07P 




14 


185 


8. 


2 


740 


12 


BI727187 


BI727187 


1031090D0 


c 


15 


184.5 


8. 


2 


1445 


28 


BZ568959 


BZ568959 


pacs2-164 




16 


168.5 


7. 


4 


646 


12 


BM000662 


BM000662 


1031090DO 




17 


160.5 


7 . 


1 


437 


28 


AZ303140 


AZ303140 


GSSBru208 


c 


18 


149.5 


6. 


6 


1289 


29 


CG746186 


CG746186 


P039-2-D0 




19 


148 


6. 


5 


786 


28 


BH541774 


BH541774 


BOGWV4 3TF 


c 


20 


146 


6. 


5 


1100 


14 


CK214645 


CK214645 


FGAS02657 




21 


145 


6. 


4 


597 


10 


AW200691 


AW200691 


se92cl0.y 


c 


22 


144 


6. 


4 


987 


28 


BZ568302 


BZ568302 


pacs2-164 




23 


141.5 


6. 


3 


3721 


11 


BC032414 


BC032414 


Homo sapi 




24 


138.5 


6. 


1 


481 


28 


AZ048545 


AZ048545 


GSSBru067 


c 


25 


134 


5 . 


9 


508 


9 


AV629852 


AV629852 AV629852 




26 


133.5 


5. 


9 


564 


14 


CA938683 


CA938683 


sav36d04 . 


c 


27 


133.5 


5. 


9 


697 


14 


CF887565 


CF887565 


UI-CF-FN0 




28 


133.5 


5. 


9 


739 


13 


BQ852657 


BQ852657 


QGB18J24 . 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



Title: 



Seguence : 



March 24, 2004, 07:49:39 ; Search time 4484 Seconds 

(without alignments) 
• 4427.098 Million cell updates/sec 

US-10-070-778-2 



Perfect score: 2262 



1 MLEPLRLSQLTVALDARLIG RSAAMDKVVAALCGSSEESH 4 58 



Scoring table: 



BLOSUM62 
Xgapop 10 
Ygapop 10 
Fgapop 6 
Delop 6 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



3470272 seqs, 21671516995 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



6940544 



Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlp 

Q=/cgn2_l/USPTO_spool_p/US10070778/runat_23032004 141616 6662/app query. fasta 
647 

-DB=GenEmbl -QFMT=fastap -SUFFIX=p2n . rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX-100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN-0 -MAXLEN=2000000000 
-USER=US10070778_@CGN_l_l_52 65_@runat_23032004_141616__6662 -NCPU=6 -ICPU-3 
-N0J4MAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK-100 -LONGLOG 
-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP= 
-FGAPEXT=7 -YGAPOP-10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database : 



GenEmbl: * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 



gb_ba : * 
gb_htg: * 
gb_in : * 
gb_om : * 
gb_ov : * 
gb_pat : * 
gb_ph : * 
gb__pl : * 
gb_pr : * 
gb_ro : * 



11: 


gb sts : * 


12: 


gb_sy : * 


13: 


gb un : * 


14: 


gb_vi : * 


15: 


em ba : * 


16: 


em fun : * 


17: 


em hum: * 


18: 


em in : * 


19: 


emjnu : * 


20: 


em om : * 


21: 


em or : * 


22: 


em ov:* 


23: 


em pat : * 


24 : 


em_ph : * 


25: 


em pi : * 


26: 


em ro:* 


27: 


em sts : * 


28: 


em un : * 


29: 


em vi : * 


30: 


em htg hum: * 


31: 


em htg inv : * 


32: 


em_htg other:* 


33: 


em htg mus : * 


34: 


em htg pin: * 


35: 


em htg rod: * 


36: 


em htg mam: * 


37: 


em htg vrt : * 


38: 


em sy : * 


39: 


em htgo hum: * 


40: 


em_htgo_mus : * 


41: 


em_htgo_other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



to. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


2262 


100, 


.0 


5280 


1 


AY008276 


AY008276 Pseudomon 


2 


2258 


99. 


. 8 


24000 


1 


AE004856 


AE004856 Pseudomon 


3 


1631 


72. 


. 1 


311600 


1 


AE016871 


AE016871 Pseudomon 


4 


1603.5 


70, 


. 9 


301995 


1 


AE016779 


AE016779 Pseudomon 


5 


962 


42. 


.5 


301727 


1 


AE016960 


AE016960 Coxiella 


6 


921.5 


40. 


.7 


11441 


1 


AE015046 


AE015046 Shigella 


7 


921.5 


40, 


, 7 


290029 


1 


AE016978 


AE016978 Shigella 


8 


917.5 


40. 


. 6 


1359 


6 


AX189056 


AX189056 Sequence 


9 


917.5 


40. 


, 6 


1800 


1 


ECU67891 


U6.7891 Escherichia 


10 


917.5 


40. 


6 


21757 


1 


AE000118 


AE000118 Escherich 


11 


913.5 


40. 


. 4 


1800 


1 


ECU67893 


U67893 Escherichia 


12 


911.5 


40. 


3 


12791 


1 


AE005185 


AE005185 Escherich 


13 


911.5 


40. 


3 


281530 


1 


AP002550 


AP002550 Escherich 


14 


910.5 


40. 


3 


1491 


1 


ECMURF 


X15432 E. coli mur 


15 


910.5 


40. 


3 


28277 


6 


AX191720 


AX191720 Sequence 


16 


910.5 


40. 


3 


28277 


15 


EC2MIN 


X55034 E. coli 2 m 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



March 24, 2004, 06:31:50 ; Search time 488 Seconds 

(without alignments) 
3987.038 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-070-778-2 
2262 

1 MLEPLRLSQLTVALDARLIG. . I RSAAMDKVVAALCGSSEESH 458 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7 . 0 

Delop 6.0 , Delext 7.0 



Searched: 



3373863 seqs, 2124099041 residues 



Total number of hits satisfying chosen parameters: 



6747726 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlp 

Q=/cgn2_l/USPTO_spool_p/US1007077 8/runat__23032004_141616_6654/app_query. fasta 
64 7 . 

-DB=N_Geneseq_2 9Jan04 -QFMT=fastap -SUFFIX=p2n . rng -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-M0DE=LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US1007 0778_(3CGN_l_l_708_@runat_23032004_141616_6654 -NCPU=6 -ICPU=3 
-N0_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_T1MEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP-10 -XGAPEXT=0 . 5 -FGAPOP- 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT-0 . 5 -DELOP=6 -DELEXT=7 

Database : N_Geneseq_2 9 Jan04 : * 

1: geneseqnl980s : * 

2: geneseqnl990s : * 

3 : geneseqn2000s : * 

4 : geneseqn2001as : * 

5: geneseqn2001bs : * 

6: geneseqn2002s : * 

7: geneseqn2003as : * 

8: geneseqn2003bs : * 

9: geneseqn2003cs : * 

10 : geneseqn2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 




Description 


1 


2262 


inn 

100 , 


. 0 


1449 


4 


AAS01762 




Aas01762 


Pseudomon 


2 


2258 


Q Q 

y y . 


Q 

. o 


1377 


4 


AAS54277 




Aas54277 


Pseudomon 


3 


2258 


y y . 


Q 

. o 


1377 


7 


ACA4 2600 




Aca42600 


Prokaryot 


4 


1616 


"7 1 


A 

. H 


1356 


7 


ACA45660 




Aca45660 


Prokaryot 


5 


1603.5 


*7 n 


Q 

. y 


1365 


7 


ACA44261 




Aca44261 


Prokaryot 


6 


917.5 


a n 
4 U . 


. D 


1359 


4 


AAS52293 




Aas52293 


E. coli D 


7 


917.5 


a n 




1359 


5 


AAH81458 




Aah81458 


Escherich 


8 


917.5 


a n 

H U . 




1359 


7 


ACA32357 




Aca32357 


Prokaryot 


9 


885.5 


j y , 


, 1 


1360 


2 


AAT63360 




Aat63360 


Temperatu 


10 


885.5 


jy , 


. 1 


1360 


2 


AAX86095 




Aax86095 


Nucleotid 


11 


882.5 


jy . 


. U 


1356 


7 


ACA31788 




Aca31788 


Prokaryot 


12 


880.5 


^ Q 
JO . 


Q 

. y 


1359 


7 


ACA5112 9 




Aca51129 


Prokaryot 


13 


879.5 


"3 o 
JO , 


. y 


1356 


7 


ACA35533 




Aca35533 


Prokaryot 


14 


875.5 


^ Q 
JO . 


n 
, / 


1362 


7 


ACA53406 




Aca53406 


Prokaryot 


15 


874 


T Q 
JO , 


. b 


1404 


7 


ACA2 62 93 




Aca26293 


Prokaryot 


16 


872.5 


"3 Q 
JO . 


. b 


1380 


7 


ACF71680 




Acf71680 


Photorhab 


17 


872.5 


JO . 


. b 


110000 


7 


ACF67367 


48 


Continuation (49 o 


18 


872.5 


"3 Q 
JO . 


. b 


110000 


7 


ACF65387 


[o 


Acf 65387 


Photorhab 


19 


871.5 


q o 
JO , 


. 3 


1359 


7 


ACA51963 




Aca51963 


Prokaryot 


20 


870.5 


q Q 
JO , 


c 
. J 


1359 


4 


AAS56339 




Aas56339 


Salmonell 


21 


857 


q "7 

J / , 


Q 

, y 


1377 


7 


ACA53865 




Aca53865 


Prokaryot 


22 


828 


J O , 


. b 


1410 


7 


ACA25050 




Aca25050 


Prokaryot 


23 


819 


J D . 


n 
. z 


1380 


7 


ACA44761 




Aca44761 


Prokaryot 


24 


812 


JJ , 


. y 


1374 


4 


AAS53451 




Aas53451 


Haemophil 


25 


812 


J J . 


, y 


1374 


7 


ACA34364 




Aca34364 


Prokaryot 


26 


812 


q R 
JJ . 


. y 


110000 


2 


AAT42063 


11 


Continuation (12 o 


27 


812 


J J , 


. y 


110000 


2 


AAT42063 


~12 


Continuation (13 o 


28 


795.5 


J J , 


. z 


2835 


7 


ACA27387 




Aca27387 


Prokaryot 


29' 


783.5 


J T i 


. 6 


1482 


7 


ACA39003 




Aca39003 


Prokaryot 


30 


783.5 


34, 


. 6 


100848 


4 


AAF28552 




Aaf28552 


Genomic f 


31 


777.5 


34 . 


.4 


1383 


7 


ACA42841 




Aca42841 


Prokaryot 


c 32 


775.5 


34 . 


.3 


69936 


3 


AAA81479 




Aaa81479 


N. mening 


33 


775.5 


34 , 


.3 


110000 


3 


AAA81490_ 


04 


Continuation (5 of 


34 


775.5 


34 . 


,3 


349980 


3 


AAF21607 




Aaf21607 


Neisseria 


35 


772.5 


34 . 


.2 


1368 


3 


AAZ53149 




Aaz53149 


Neisseria 


36 


760 


33. 


.6 


1359 


3 


AAZ53148 




Aaz53148 


Neisseria 


37 


757 


33, 


,5 


1359 


3 


AAZ53150 




Aaz53150 


Neisseria 


38 


757 


33, 


.5 


1359 


7 


ACA42011 




Aca42011 


Prokaryot 


39 


756 


33, 


, 4 


1364 


7 


ACA4 9178 




Aca49178 


Prokaryot 


40 


744 


32. 


, 9 


1308 


7 


ABZ41552 




Abz41552 


N. gonorr 


41 


744 


32. 


, 9 


1308 


7 


ACA41078 




Aca41078 


Prokaryot 


42 


731 


32, 


,3 


1119 


6 


ABQ90030 




Abq90030 


M. capsul 


43 


729.5 


32, 


,3 


1452 


8 


ADA30607 




Ada30607 


DNA encod 


44 


726.5 


32. 


, 1 


1398 


7 


ACA20661 




Aca20661 


Prokaryot 


45 


695 


30. 


,7 


1158 


7 


ACA23517 




Aca23517 


Prokaryot 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: March 24, 2004, 08:13:16 ; Search time 111 Seconds 

(without alignments) 
2289.797 Million cell updates/sec 



Title: 



Sequence : 



Scoring table: 



Searched: 



US-10-070-778-2 



Perfect score: 2262 



1 MLEPLRLSQLTVALDARLIG RS AAMDKVVAALCGS S EESH 4 58 

BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 
Ygapop 10.0 , Ygapext 0.5 
Fgapop 6.0 , Fgapext 7.0 
Delop 6.0 , Delext 7.0 

682709 seqs, 277475446 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1365418 



Post -processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlp 

Q=/cgn2_l/USPTO_spool_p/US1007 077 8/runat_23032004_141617_6682/app_query.fasta_ 
64 7 . 

-DB=Issued_Patents_NA -QFMT-fastap -SUFFIX=p2n . rni -MINMATCH-0 . 1 -LOOPCL-0 
-LOOPEXT=0 -UNITS=bits -START-1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX-100 -THR_MIN=0 -ALIGN=15 
-M0DE=LOCAL -OUTFMT=pto -NORM-ext -HEAPSIZE=500 -MINLEN=0 -MAXLEKN2000000000 
-USER=US1007077 8_@CGN_l_l_8 5_@runat_23032004_141617_6682 -NCPU-6 -ICPU=3 
-N0_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT-120 -WARNJTIMEOUT=30 -THREADS-1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP-6 -DELEXT-7 



Database : 



Issued Patents NA: 
1 
2 
3 
4 
5 
6 



/cgn2_6/ptodata/2/ina/5A_COMB. seq: * 
/cgn2_6/ptodata/2/ina/5B__COMB. seq: * 
/cgn2_6/ptodata/2/ina/6A_COMB. seq: * 
/cgn2_6/ptodata/2/ina/6B_COMB. seq: * 
/cgn2_6/ptodata/2/ina/PCTUS_COMB. seq: * 
/cgn2__6/ptodata/2/ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of. the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

o 
o 



Result Query 





No. 


Score 


Match 


Length ] 


DB 


ID 








Description 




1 


2250 


QQ 
J y . 


c 
9 


1515 


4 


US- 


09- 


252- 


991A-7701 


O i^f 1 T f~\ ir\ /~> r\ 

0 ecjuence 


7 7 01 An 


c 


2 


1691 


7 A 
1 4 . 


Q 

o 


1215 


4 


us- 


09- 


252- 


991A-7862 


O /-\ s~*f 1 t s~\ y^t r\ 

sequence 


7 ft £9 J\r~\ 
1 O DZ , rip 




3 


885.5 


9 Q 

j y . 


1 


1360 


2 


us- 


08- 


732- 


612-1 


Sequence 


1 , Hpp_Ll 




4 


876.5 


9ft 


7 


1437 


4 


us- 


09- 


489- 


039A-5205 




JZ U J / -"P 




5 


821.5 


9 £ 


o 
J 


1419 


4 


US- 


09- 


543- 


681A-2913 


O •» -1 /-s y\ s-v 

oequence 


9 Q 1 9 Z\t~i 
Z -/ J. 9 f tiy 




6 


812 


9 ^ 
9 9 . 


q 


1830121 


4 


US 


-09 


-557 


-884-1 


O s~\ /^ri T y^k r-s 

oGquGiice 


1 A -r^r\ 1 

: 1 , /\pp_L . 




7 


812 


9 ^ 
J O . 


Q 


1830121 


4 


us 


-09 


-643 


-990A-1 




1 f Appl . 




8 


784.5 


9 A 

9 4 . 


1 
1 


1506 


4 


us- 


09- 


540- 


236-1209 


Sequence 


1 9 PI Q 7\r-\ 




9 


783.5 


9 A 
04 . 


0 


100848 


4 


us- 


09- 


596- 


002-39 


O r^i 1 t r~\ tt\ /~-t t~\ 

0 e qut3iicG 


9iJ, /ippj. 




10 


729.5 


9 9 


Zi 

9 


1452 


4 


us- 


09- 


328- 


352-1894 


O t v-s r-* /-\ 

0 equericti 


1 ft QA Tlt~\ 


c 


11 


687 


9 C\ 
9U . 


/I 


640681 


4 


us- 


09- 


790- 


988-1 


Sequence 


1, Appli 




12 


593 


9 £ 
Z D . 


9 
Z 


6911 


1 


us- 


08- 


311- 


174-4 


S equence 


4 , ^PP J- 1 


c 


13 


574 


9 c: 


/I 


4411529 


3 


us 


-09 


-103 


-840A-1 


Sequence 


! 1 , App 1 . 


c 


14 


572 


9 c; 
ZO . 


J 


4403765 


3 


us 


-09 


-103 


-840A-2 


Sequence 


: Z , -ApP -L - 




15 


551.5 


9 A 

Z 4 . 


4 


7363 


4 


us- 


08- 


956- 


171E-19 


Sequence 


ly f Appl 




16 


544.5 


9 A 
Z 4 . 


1 


1400 


3 


us- 


09- 


144- 


918-1 


Sequence 


1, Appli 




17 


536.5 


9 9 
Z 9 . 


/ 


1374 


4 


us- 


09- 


134- 


001C-1447 


Sequence 


1 / A 7 7\t^ 
14 4/, Ap 




. 18 


528 


9 9 
Z J . 


7 


1404 


4 


us- 


09- 


252- 


991A-7623 


Sequence 


7^99 An 

/ 0 Z J , Ap 




19 


465.5 


9 p 
ZU . 


b 


1452 


4 


us- 


09- 


134- 


000C-1457 


Sequence 


1 /I C "7 At-. 

14 0/, Ap 




20 


459. 5 


9 Pi 
Z U . 


7 

9 


1847 


4 


us- 


08- 


961- 


527-196 


Sequence 


1 y 0 , App 




21 


452.5 


9 Pi 
Z U . 


U 


1702. . 


3 


us- 


09- 


143- 


954-1 


Sequence 


1 Ann 1 -i 

1 , App 11 




22 


447.5 


i y . 


0 
0 


1676 


3 


us- 


09- 


143- 


954-3 


Sequence 


3, Appli 




23 


■ 420.5 


i 0 


0 


1455 


4 


us- 


09- 


107- 


532A-356 


Sequence 


O E, £~ Ann 

J DO, ^-PP 




24 


416 


i ft 


/I 


1230025 


4 


us 


-09 


-198 


-452A-1 


oequence 


1 A nn T 

: 1 , Appi. 


c 


25 


276 


1 9 
1 Z . 


z 


1611 


4 


US- 


09- 


252- 


991A-7930 


oequence 


7 Q 9 H 7\t\ 
1 y 9 U , Ap 




26 


273 


1 9 
1 z . 


1 


1335 


4 


US- 


09- 


252- 


991A-7551 


ot3qut3rice 


7 C C 1 An 
/ O O X f t\\J 




27 


271 


1 9 
1 Z . 




1401 


4 


US- 


09- 


252- 


991A-7622 


oequence 


/ D Z Z , /-ip 


c 


28 


264.5 


1 1 . 


9 


1230025 


4 


US 


-09 


-198 


-452A-1 


Sequence 


1 A nn 1 
! X r App 1 . 




29 


263 


1 1 


0 


1094 


3 


US- 


09- 


144- 


918-3 


Sequence 


9, Appli 




30 


261.5 


1 1 


0 


1509 


4 


US- 


09- 


489- 


039A-5144 


oequence 


C "1 A A An 

014 4, Ap 




31 


255 


1 1 


-D 

9 


1509 


4 


us- 


09- 


543- 


681A-2855 


Sequence 


OQ r C An 

Z 0 0 0 , Ap 




32 


246 


1 Pi 
1U . 


y 


5757 


3 


us- 


08- 


984- 


618-1 


Sequence 


1, Appli 




33 


245 


1 PI 

J. u . 


Q 
O 


1416 


4 


us- 


09- 


328- 


352-1282 


octjucIlLc 


1 9R? An 




34 


241 


10. 


7 


1542 


4 


us- 


09- 


252- 


991A-7625 


Sequence 


7625, Ap 


c 


35 


241 


10. 


7 


1701 


4 


us- 


09- 


252- 


991A-7927 


Sequence 


7 927, Ap 




36 


232 


10. 


3 


1371 


4 


us- 


09- 


252- 


991A-7702 


Sequence 


7702, Ap 


c 


37 


232 


10. 


3 


1401 


4 


us- 


09- 


252- 


991A-7861 


Sequence 


7861, Ap 




38 


231 


10. 


2 


2600 


3 


us- 


08- 


988- 


251-1 


Sequence 


1, Appli 




39 


231 


10. 


2 


2600 


3 


us- 


09- 


386- 


048-1 


Sequence 


1, Appli 




40 


231 


10. 


2 


12658 


4 


us- 


08- 


956- 


171E-127 


Sequence 


127, App 




41 


227.5 


10. 


1 


1503 


4 


us- 


09- 


328- 


352-2140 


Sequence 


2140, Ap 




42 


224 


9. 


9 


1503 


4 


us- 


09- 


134- 


001C-1153 


Sequence 


1153, Ap 




43 


220 


9. 


7 


1572 


4 


us- 


09- 


540- 


236-1219 


Sequence 


1219, Ap 




44 


212 


9. 


4 


2193 


2 


us- 


08- 


934- 


481-1 


Sequence 


1, Appli 




45 


212 


9. 


4 


2193 


4 


us- 


09- 


290- 


602-1 


Sequence 


1, Appli 



ALIGNMENTS 



RESULT 1 

US-09-252-991A-7701 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



March 24, 2004, 09:10:24 



Search time 2940 Seconds 

(without alignments) 

576.263 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-070-778-2 
2262 

1 MLEPLRLSQLTVALDARLIG 



RSAAMDKVVAALCGSSEESH 4 58 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



2438257 seqs, 1849576744 residues 



Total number of hits satisfying chosen parameters: 



4876514 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+jp2n. model -DEV=xlp 

Q=/cgn2_l/USPTO_spool_p/US1007 077 8/runat_23032004_141619_674 6/app_query. fasta_ 
64 7 

-DB=Published_Applications_NA -QFMT=fastap -SUFFIX=p2n . rnpb -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 
-TRANS=human4 0.cdi -LIST-45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 
-THR_MIN=0 -ALIGN=15 -MODE-LOCAL -OUTFMT=pto -NORM-ext -HEAPSIZE=500 -MINLEN=0 
-MAXLEN-2000000000 -USER=US 1007077 8_@CGN_l_l_333_@runat_23032004_14 1619_67 4 6 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 . 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT-30 -THREADS=1 -XGAPOP-10 -XGAPEXT=0 . 5 
-FGAP0P=6 -FGAPEXT-7 -YGAPOP=10 -YGAPEXT-0 . 5 -DELOP=6 -DELEXT=7 

Database : Published_Applicat ions_NA : * 



1: /cgn2_6/ptodata/l/pubpna/US07_PUBCOMB. seq: * 

2: /cgn2_6/ptodata/l/pubpna/PCT_NEW_PUB.seq: * 

3: /cgn2_6/ptodata/l/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/l/pubpna/US06_PUBCOMB.seq: * 

5: /cgn2_6/ptodata/l/pubpna/US07_NEW_PUB.seq: * 

6: /cgn2_6/ptodata/l/pubpna/PCTUS__PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/l/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/l/pubpna/US08_PUBCOMB . seq: * 

9: /cgn2_6/ptodata/l/pubpna/US09A_PUBCOMB.seq: * 

10: /cgn2_6/ptodata/l/pubpna/US09B_PUBCOMB.seq: * 



11 : /cgn2_6/ptodata/l/pubpna/US09C_PUBCOMB. seq: * 

12: /cgn2_6/ptodata/l/pubpna/US09_NEW_PUB. seq: * 

1 3 : /cgn2_6 /pt oda t a / 1 /pubpna /US 1 0A_PUBCOMB . s eq : * 

14 : /cgn2_6/ptodata/l/pubpna/US10B_PUBCOMB. seq: * 

15: /cgn2_6/ptodata/l/pubpna/US10C_PUBCOMB.seq: * 

16: /cgn2_6/ptodata/l/pubpna/US10_NEW_PUB. seq: * 

17 : /cgn2_6/ptodata/l/pubpna/US60_NEW_PUB. seq: * 

1 8 : /cgn2_6/pt odata / 1 /pubpna/US 60_PUBCOMB . seq : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



•Result 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


2258 




Q 
. o 


1377 


9 


US-09-815-242-7914 


Sequence 7914, Ap 


2 


2258 


QQ 


o 

. 0 


1377 


12 


US-10-282-122A-30470 


Sequence 30470, A 


3 


1616 


1 1 


4 


1356 


12 


US-10-282-122A-33530 


Sequence 33530, A 


4 


1603.5 


70 


9 


1365 


12 


US-10-282-122A-32131 


Sequence 32131, A 


5 


917.5 


40 


. 6 


1359 


9 


US-09-741-669-257 


Sequence 257, App 


6 


917.5 


40 




1359 


9 


US-09-815-242-5930 


Sequence 5930, Ap 


7 


917.5 


4 n 


a 


1359 


12 


US-10-282-122A-20227 


Sequence 20227, A 


8 


882.5 


39 


. 0 


1356 


12 


US-10-282-122A-19658 


Sequence 19658, A 


9 


880.5 


o o 


q 


1359 


12 


US-10-282-122A-38999 


Sequence 38999, A 


10 


879.5 


■j <_> 


Q 
• ~J 


1356 


12 


US-10-2'8 2-122A-234 03 


Sequence 23403, A 


11 


875.5 


38 


m 7 


1362 


12 


US-10-2 82-122A-4127 6 


Sequence 41276, A 


12 


874 


o o 


. o 


1404 


12 


US-10-282-122A-14163 


Sequence 14163, A 


13 


871.5 


38 


. 5 


1359 


12 


US-10-282-122A-39833 


Sequence 39833, A 


14 


870.5 


3 0 


. D 


1359 


9 


US-09-815-242-9976 


Sequence 997 6, Ap 


• 15 


857 


37 


. 9 


1377 


12 


US-10-282-122A-41735 


Sequence 41735, A 


16 


828 


36 


. 6 


1410 


12 


US-10-282-122A-12 920 


Sequence 12920, A 


17 


■ 819 


36 


.2 


1380 


12 


US-10-282-122A-32 631 


Sequence 32631, A 


18 


812 


35 


. 9 


1374 


9 


US-09-815-242-7088 


Sequence 7088, Ap 


19 


812 


35 


. 9 


1374 


12 


US-10-282-122A-22234 


Sequence 22234, A 


20 


812 


35 


. 9 


1830121 


14 


US-10-329-960-1 


Sequence 1, Appli 


21 


812 


35 


. 9 


1830121 


15 


US-10-329-670-1 


Sequence 1, Appli 


22 


795.5 


35 


.2 


2835 


12 


US-10-282-122A-152 57 


Sequence 15257, A 


23 


783.5 


34 


. 6 


1482 


12 


US-10-282-122A-26873 


Sequence 26873, A 


24 


777.5 


34 


. 4 


1383 


12 


US-10-2 8 2-122A-30711 


Sequence 30711, A 


25 


757 


33 


.5 


1359 


12 


US-10-282-122A-29881 


Sequence 29881, A 


26 


756 


33 


. 4 


1364 


12 


US-10-282-122A-37048 


Sequence 37048, A 


27 


744 


32 


. 9 


1308 


12 


US-10-282-122A-28948 


Sequence 28948, A 


28 


726.5 


32 


. 1 


1398 


12 


US-10-282-122A-8531 


Sequence 8531, Ap 


29 


702.5 


31 


.1 


9025608 


14 


US-10-156-761-1 


Sequence 1, Appli 


30 


701.5 


31 


0 


1413 


14 


US-10-156-761-6095 


Sequence 6095, Ap 


31 


695 


30 


.7 


1158 


12 


US-10-28 2-122A-1138 7 


Sequence 11387, A 


c 32 


687 


30 


A 


640681 


9 


US-09-790-988-1 


Sequence 1, Appli 


33 


618 


27 


3 


1542 


9 


US-09-738-626-2369 


Sequence 2369, Ap 


c 34 


618 


27 


3 


3309400 


9 


US-09-738-626-1 


Sequence 1, Appli 


35 


604.5 


26 


7 


1488 


12 


US-10-282-122A-1784 3 


Sequence 17843, A 


36 


572.5 


25 


3 


1374 


12 


US-10-282-122A-9891 


Sequence 98 91, Ap 


37 


569 


25 


2 


1533 


9 


US-09-712-363-81 


Sequence 81, Appl 


38 


569 


25 


2 


1533 


12 


US-10-282-122A-28471 


Sequence 28471, A 


39 


556.5 


24. 


6 


1353 


9 


US-09-815-242-4300 


Sequence 4300, Ap 



